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ProgFamming aid enabling source code to be inewed with a general purpose document browser 



(57) Disclosed is a programming aid for enabling 
computer program source code to be viewed using a 
general purpose document browser responsive to a 
defined set of control tags determining the appearance of 
a document the aid comprising: a parser for generating a 
symbol table from an Input file in which the program is 
stored; means to derive control tag locations from the 
symbol table; and an output file generator for inserting, 
530, control tags Into the program at the denved locations 
*o generate an output file which can be viewed using the 
general purpose document browser. In this way, the full 
power of user-chosen general purpose document 
browsers may t>e used in software development and 
maintenance to view programs in a convenient way. 
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2312973 



PROGRAMMING AID 

TBCHNICAIi FIBIiD ^ 

This invention relates to computers and, more psurticularly, to a 
programming aid £or enabling a computer program in source code form to be 
viewed using a general purpose document browser « 

BACKC^OUND ART 

Most programmers are £aced at some time or another with the 
prospect of imder standing the structure of existing software, whether it 
be for maintenance or development. More often than not, program 
documentation is insufficient and a prograimner is forced to e x a m i n e 
actual source code for this task. For large and complex programs this 
exercise is time consuming and error prone. Consequently, tools to aid 
in the efficient and effective inspection of programs are very valuable. 
One type of such tools is program browsers . 

Program browsers are sometimes provided with modem congpiier 
products amd require a language dependent program processor, whose 
functionality includes fully- fledged parsing of the source code as well 
as some semantic analysis ^ Also desirable is the provision of 
sophisticated and efficient display and search capabilities, together 
with a convenient user interface. 

However, at the present time the very rapid cha ng es and 
developments in the field of browser technology has meant that providing 
program development tools which keep pace with the latest developments in 
browser technology . can be a real problem. In addition, changes in 
programming languages give rise to the need for new browsers . 

Also available are document browsers which take as input a document 
which includes certain control tags , interpret the tags and present the 
document to the user on a computer screen. Examples of such browsers are 
the IBM BookManager software product from IBM Corp, which uses the - 
BoolCMaster tag language and the various browsers which are available to 
view documents which en^ploy HTML tags . The HTML tag language is used by 
documents which make up the world wide Web. 
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DISCIiOSXTRB OF THE INVENTION 
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The present invention seeks to pirovide an inyproved program browser 
by providing a progrannaing aid for enabling a con^uter program in source 
code form to be viewed using a general purpose document browser, which 
browser is responsive to a defined set of control tags determining the 
appearance of a document thereon, the programming aid comprising:, a 
parser for generating a symbol table from an input file in which the 
program is stored; means to derive control tag locations from the symbol 
table; and an output file generator for inserting control tags into the 
program at the derived locations to generate aji output file which can be 
viewed using the general purpose document browser. 
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In this way, the full power of available general purpose document 
browsers may be used in software development and maintenance to view 
programs in a convenient way. A tool is provided tliat automatically 
marks up a program with co ntro l tags so J^at users are free to u se the 
document browser of their choice as a powerf ul program browser . This 
gives the user the flexibility and convenience of using a browser with a 
user interface or other features that they may prefer, or with which they 
are familiar, and means that they are not necessarily forced to leani a 
new user interface each time they use a new programming system. 
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Furthermore, the invention reduces the burden on compiler and * 
programming language developers since it potentially removes the need, to 
reimplement sophisticated display and search capabilities and the user 
interface of a dedicated program browser each time a new programming 
language is introduced. 
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If the control tags axe the HTML control tags, or tags similar in 
form and. purpose, the invention provides a form of distributed program 
browser whereby programs can be conveniently viewed by a number of 
different users at remote locations over the World wide web. 



35 



BRIEF DESCRIPTION OF DRAWINGS 



An embodiment of the invention will now be described by way of 
exan^le only with reference to the accon^euiying drawings, wherein: 
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Fig. 1 is a schematic diagram showing a programming aid; 
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Figs 2A. and 2B are flow diagrams showing the general operation of 
the programming aid; 

Fig 3 is a flow diagram showing a phase 1 parsing process; 

Fig 4 is a flow diagram showing a build lin]cs process; 

V " , . . .... _ - ■ - 

Fig 5 is a flow diagram showing a phase 2 parsing process « 

10 BEST MODB(S) FOR CARRYING OUT THE INVENTION 

Fig 1 is a schematic diagram illustrating the main functional 
components of a programming aid in one embodiment of the invention. The 
apparatus comprises a parser 100 which is arranged to receive an input 

15 file 110 including a computer program in soiirce code form. The parser 

generates a symbol table 120. An output file generator 130 uses the 
information in the symbol table to parse the input file for a second time 
and to generate an output file 140 whicdi includes control tags inserted 
at certain locations to enable the program code to be viewed using a 

20 general purpose document browser which is responsive to the control tags, 

the tags determining the appearance of the program on the browser. 

Figs 2A emd 2B together form a flow diagram illustrating the 
overall operation of the system^ The process staxts at step 200. The 
25 various data structures used by the program are initialised in step 210 

and a prolog inserted in an output file in step 220. Parser 100 then 
performs a first paxsing phase in step 230. This phase is described in 
more detail below. This paxsing step is repeated for any further input 
files following loop 240. 

30 

The aim of the parsing step is to generate symbol table 120. 
Symbol table 120 is in essence a conventional symbol table as would be 
generated by the parser of a conventional compiler although for this 
application not all the information that i conventional compiler symbol 
35 table contains may: be needed. The symbol table contains all the tokens 

found while scanning the input files. For every token, a set of 
^ attributes associated with the token is kept. 

In addition, the inter-relations among different tokens are kept in 
^0 the table. An exan^le of a Token is a keyword (e.g., IF, ELSE, FOR), a 

variable name, a routine name, etc. An example of interrelation is a 



CALL relation (routine X is calling routine Y) , a CONSTRUCT relation 
(e.g., iLSE X is part of an IF statement that begun %rf.tli IF Y) , etc. 

It will be understood by those skilled in the art that there are 
many ways to implement the table and in most implementations the table 
would be a. combination of several primitive data structures, such as 
arrays, trees, linked lists, or the like. The detailed structure of a 
symbol table will be well understood by those skilled in the art and need 
not be discussed in any further detail herein. 

Fig 3 is a flow diagram illustrating the phase 1 parsing step. A 
pointer is initialised at the beginning of the input file in step 300. 
Data is read in step 310 until a beg inning -of -construct or end-of- 
construct delimiter is identified. The beginning-of -construct delimiter 
can simply be the ASCII code representing a space followed an ASCII 
character representing a letter of the alphabet, the end- of -construct 
delimiter can be for instance the ASCII code representing a space. This 
will, of course, depend upon the particular programming language 
concerned- Once a beginning -of -construct and end-of -construct pair is 
identified, the data between them is recognised as a token and vnritten to 
the symbol table 120 in step 320- 

It will be understood that parser 100 is program language Specific 
and would need an in«)lementation for each programming language supported 
by the tool. The other coii«>onents of the tool could be iH«>lemented in a 
manner which is common to different programming languages. The tool 
could be provided with more than one parser 100 so that it can support 
more than one prograaming language. 

After the phase 1 parsing step 230, a build links process 250 is 
performed. This process adds to Symbol table 120 all the information 
regarding the interrelations among different Tokens. 

Fig. 4 is a flow diagram illustrating the build links process. 

For every token in the table, the BUILD LINKS Process fijids all the 
tokens that reference it in step 400 and adds the attributes of the 
referencing tokens to the entry of the referenced token in the table in 
step 410- 
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The BUILD LINKS Process then identifies all the referencing tokens 
in the symbol table in step 420. For every referencing token, it ; 
identifies the associated referenced token in step 430 and adds the 
attributes of the referenced token to the entry of the referencing token 
dLn the table in step 440- ^ 

Bxainples of a referencing token are a CALL statement > a MACRO usage 
or a vaxiable usage. Corresponding examples of a referenced token are a 
routine invoked by a CALL statement, a MACRO definition eind a variable 
definition, 

using the attributes that were recorded in the table during the 
BUILD LINKS process, a 'Phase 2 Parsing' Process shown as step 260 in Fig 
2B generates the appropriate Maxkup Language tags to create the necessary 
hypertext links between referencing objects and referenced objects. 

Fig 5 is a flow diagram illustrating the phase 2 parsing process. 
The file pointer is initialised to the beginning of the input file in 
step 500. The tokens from the table are read in the sequence in which 
they feattire in the input files in step 510, Data corresponding to the 
token is read from the input files in step 520. Then data is written to 
the output file in step 530 which includes the data from the input file 
and suitable control tags which are appropriate to the token type and, if 
SLppllcablB, any associated references. 

Once all iziput files have been processed in this manner, a suitable 
epilog is inserted at the end of the output file, see step 270 in Fig 2. 

The output file is then in a suitable format to be viewed using a 
general purpose document browser which sujpports the particular tag 
language for the inserted tags . 

Tt will be understood that output file generator 130 is specific to 
a particular tag language and would need an implementation for each tag 
language supported by the tool. The other con^nents of the tool could 
be isc^lemented in a manner which is common to different tag languages. 
The tool could be provided with more than one output file generator 130 
so that it can support more than one tag language. 

There are many different ways in vdiich the features of various tag 
languages may be used to enhance to appearance of program code. The 
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following are a few examples of the type of enHaacements that can be 
used. 

Highlighted tags can be used to apply special fonts and colours to 
different language constructs, such as keywords, constants and 
identifiers. Each function name can be marked with a heading tag. This 
enables ^a table of contents to be generated which includes a list of 
function names. Using hierarchical headings for nested functions can 
assist in visualising nested functions. 

vertical lines can be used to en^hasise block structures of 
programs. Hypertext links can be added which link the beginning and end 
of a program block and/or between a declaration of a program variable and 
its usage. In this way using certain document browsers, a mouse click on 
a variable can be used to immediately display its declaration. 

Each function call or goto statement can be linked to its target 
and/or indexed to enable an effective search of the calling locations of 
fxinctions or all labels. 

Deletion tags can be used to mark code which has been commented 
out. Normal text control tags, eg paragraph markers or the like, can be 
used in comment statements to obtain text of the normal quality found in 
publications. 

comments in the program can be linked to other related documents 
stored in softcopy form, such as user documentation or functional 
specification documents, as well as other information sources on the 
world wide Web. 

A summary of the above types of markup element and the tags used in 
the IBM BookMaster language is shown in Table I. 

A description of the IBM BookMaster tag language can be found in 
the IBM Publication number GC34 - 5006 - 6 ' BookMaster General Information' 
(1993) and other published IBM manuals associated with the product. 



TABLE I 



Program Element 


Tag Type 


Example 


Language construct 


Highlighted phrase tag 


:hpl - :hp9 


Function' name 


Hierarchical headings 
tag 


:h2 ' :h20 


Blocks structure 


Vertical lines 
connecting the 
beginning of a block 
with its end or 
hypertext link 


^cbxul . , &bxv, £bxll • T 
ispot. / .•spotre£. 


Declarations of 
program variables 
r*cvn^^ piT\^ ^ OT" ind.cros 


Indexing tags and 
associative links 




Function call or Goto 
statement 


Indexing tags for 
seaxches only 


:±1 ±xiclude=sBarch 


Function call or goto 
statement 


Hypertext links 


: spot . / : spotref 


Commented out code 


Meirked Deletion 


:md 


Other related 
Documentation. 


Hypertext links 


sdocdesc. , :autolxhk.. 



Once the marked up program file is generated, it can be used in all 
those operating systems which sure supported by the sof tcopy system* For 
example, the IBM BookManager system can be used in the VM, DOS, OS/2, AIX 
and MVS operating system environments* The environment which is used by 
the programming tool to generate the marked up file does not limit the 
environment in which it can be viewed. It is possible to parse the input 
code in one operating environment to be viewed in another. Use of the \ 
HTML tags language would enable a program to be view using the World- 
Wide -Web thus providing a form of distributed browser. Further 
information about the World Wide Web can be found in "Spinning the Web" 
by Andrew Ford (International Thomson Publishing, London 1995) and "The 
World Wide Web Unleashed" by John December and Neil Randall (SAMS 
Publishing, Indianapolis 1994) . 
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There has been described a new approach to developing program 
- . browser using front-end compiler technology and existing general purpose 
document browsers, which already have efficient browsing features and 
sophisticated user interfaces. This approach eases significantly the 

5 effort reauired to implement a program browser for new programming 

languages and new programming systems and enables all the power of 
existin:g document browsing systems to be employed to enable programs , to 
be examined. It will be noted that the source code to be processed using 
the programming tool does not need to be modified in any way by the 

10 process - 

AS Will be clear from the above description, the present 
implementation takes the form of a computer program and can be 
distributed in the form of an article of manufacture comprising a 
computer usable medium in which suitable program code is embodied for 
causing a computer to perform the function of the programming aid 
described above • 

The prograiaming aid could take tbe form of a 'stand-alone' system 
or could be implemented as part of a compiler or other programming tool 
which is provided with the facility to export marked up files for viewing 
using a general purpose browser. 
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XNDUSTRXAIj APPIiICABIIjITT 

The invention is applicable to the industrial field of computers 
and computer program development systems* 



CIAIMS 



1. A programming aid for enabling a con^uter program in soiirce code 
form to be viewed using a general, purpose document browser, v^ich browser 
is responsive to a defined set of control tags determining tbe appearance 
of a document tbei^eon, the programming aid comprising: 

a parser for generating a symbol table from an input file in wbich 
the program is stored; 

means to derive control tag locations from the symbol table; 

an output file generator for inserting control tags into the 
program at the derived locations to generate an output file which can be 
viewed using the general purpose docximent browser • 

2. A programming aid as claimed in claim 1 con«)rising means to 
generate, and to record in the symbol table, linlcs between constructs in 
the program and \rtierein at least some of the control tags are arranged to 
link locations in the output file corresponding to such linked 
constructs • 

3. A programming aid as claimed in claim 1 or claim 2 wherein at least 
some of the control tags form hyper te^ 

4 / A programming aid as claimed in any preceding claim wherein the 
control tags are BookMana ger control tags or HTML control tags. 

5. A programming aid as claimed in any preceding claim in the form of 
a compiler having the facility to export files which include the control 
tags. 

6. A programming aid as claimed in any preceding claim con«>rising a 
plurality of parsers for supporting a plurality of programming languages. 

7. A programming aid as claimed in any preceding claim comprising a 
plurality of output file generators for supporting a plurality of control 
tag languages. 
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